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ABSTRACT 



Sugarcane is an important tropical cash crop meeting 75% of world sugar demand and it is fast 
becoming an energy crop for the production of bio-fuel ethanol. A considerable area under sugar- 
cane is prone to waterlogging which adversely affects both cane productivity and quality. In an 
effort to elucidate the genes underlying plant responses to waterlogging, a subtractive cDNA library 
was prepared from leaf tissue. cDNA clones were sequenced and annotated for their putative func- 
tions. Major groups of ESTs were related to stress (15%), catalytic activity (13%), cell growth (10%) 
and transport related proteins (6%). A few stress-related genes were identified, including senes- 
cence-associated protein, dehydration-responsive family protein, and heat shock cognate 70 kDa 
protein. A bioinformatics search was carried out to discover novel microRNAs (miRNAs) that can 
be regulated in sugarcane plants subjected to waterlogging stress. Taking advantage of the presence 
of miRNA precursors in the related sorghum genome, seven candidate mature miRNAs were identi- 
fied in sugarcane. The application of subtraction technology allowed the identification of differen- 
tially expressed sequences and novel miRNAs in sugarcane under waterlogging stress. The 
comparative global transcript profiling in sugarcane plants undertaken in this study suggests that 
proteins associated with stress response, signal transduction, metabolic activity and ion transport 
play important role in conferring waterlogging tolerance in sugarcane. 

© 2014 The Authors. Published by Elsevier B.V. on behalf of the Federation of European Biochemical Societies. This 
is an open access article under the CC BY-NC-ND license (http://creativecommons.Org/licenses/by-nc-nd/3.0/). 
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1. Introduction 

Sugarcane, belonging to the genus Saccharum (Poaceae), is an 
important industrial crop accounting for nearly 75% of sugar pro- 
duced worldwide [1]. Being a round the year crop, it has to face 
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a wide range of vagaries of nature, specially biotic and abiotic 
stresses [2], Incessant rains during the monsoon cause flooding 
and waterlogging, especially in low lying areas near the rivers. This 
adversely affects cane productivity and maturity. Thus, a major 
portion of the sugarcane production is lost, causing heavy financial 
loss to the farmers and millers. In sugarcane cultivation, among 
abiotic stresses, waterlogging is an acute problem particularly 
where surface drainage facilities are not adequate [3], The extent 
of waterlogging injury to the plant is dependent on several factors, 
such as flooding depth, duration and flow of water in the field [4,5]. 
The need for improved productivity, coupled with limited 
resources for environmental modification and the availability of 
only a finite amount of land for expansion, points to genetic 
improvement as the major means to augment sugarcane yield. 
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Understanding the physiological and molecular basis of stress tol- 
erance/resistance is crucial in improving the performance of a crop 
especially in stressful environment [6]. 

Considerable progress has been made in understanding the 
molecular, biochemical and physiological basis of traits that enable 
plants to cope with individual stresses. The molecular improve- 
ment approach holds promise in being able to selectively incorpo- 
rate specific traits into superior cultivars in a fraction of the time 
and reducing the effort required to achieve similar results through 
conventional breeding [7,8]. Genome sequencing and expressed 
sequence tags (ESTs) development has great potential in facilitat- 
ing crop improvement for stress resistance. It is becoming realistic, 
for example, to combine large amounts of DNA sequence data with 
high-throughput molecular biology methods to identify genes that 
are differentially expressed under specific environmental condi- 
tions [8,9]. EST collections may be useful as direct sources of genes, 
but their major advantage is in allowing comparisons of expression 
pattern to be made between target tissues. Such evaluations have 
been made highly efficient by the application of array screening 
technologies and these have been gainfully employed in sugarcane, 
using both macroarray and microarray systems [10,11]. Although 
there is no real substitute for a complete genome sequence, cost- 
benefit analysis has proved that ESTs are worth their cost and form 
the core foundations for various genome scale experiments within 
the plant genomes like sugarcane that are tough to assemble due to 
their complexity [12]. A large amount of sugarcane EST resource 
has already been developed and submitted to GenBank that has 
been providing a very valuable means for gene discovery [13,14], 
A drought-inducible gene, designated as SoDip22, has been identi- 
fied in the leaves of sugarcane, using this approach [15]. Data min- 
ing of the SUCEST database with cold inducible genes from other 
plant species identified 33 assembled sequences that were homol- 
ogous [16]. Real-time reverse transcription-PCR profiling of 
selected EST clusters identified several sugarcane clusters that 
show differential expression in response to red rot infection and 
drought stress conditions in sugarcane [17]. 

Small 20-24 nucleotides (nt) non-coding RNAs are important 
regulators of gene expression, causing either transcriptional gene 
silencing (TGS) or post-transcriptional gene silencing (PTGS) 
[18,19]. Several distinct size classes of small RNAs with dedicated 
functions have been identified, including miRNAs [20]. Deep 
sequencing and bioinformatics strategies offer a powerful means 
for quantitative and qualitative profiling of small RNA populations 
and it is convenient for exploring small RNAs in plant species such 
as sugarcane [21-23]. EST analysis has proven to be an economi- 
cally feasible alternative for miRNAs discovery in species lacking 
a draft genome sequence [24]. We used EST analysis and DNA data- 
base analysis in detail to identify new potential miRNA genes and 
their targets. Since the ESTs and expressed genes come from the 
production of actual gene expression, our investigation provides 
more support and confidence in the discovery of new potential 
miRNAs and their targeted genes. 

The information related to gene expression in sugarcane under 
waterlogging stress is still patchy, so the present study was under- 
taken to generate ESTs induced under waterlogging stress and to 
have a glimpse of genes expressed under such stress. In this study, 
we also used a bioinformatics search based on sequence similarity 
or the secondary structure of precursors, and have identified seven 
novel miRNAs expressed in water-logged sugarcane. 

2. Materials and methods 

2.1. Plant material 

In an effort to elucidate the genes underlying plant responses to 
inundation, a waterlogging tolerant popular sugarcane variety, BO 



91, suitable for the North Central zone in India, was grown at 
Indian Institute of Sugarcane Research experimental farm, under 
stress (in lowland area) and normal condition for preparation of 
subtractive cDNA library. This farm has both lowland and upland 
areas. This lowland area gets the runoff water from the upland 
and remains waterlogged during the monsoon. About 30-35 cm 
water level was maintained for one month in the lowland area in 
order to maintain the waterlogging stress. 

2.2. Total and poly (A)+ RNA isolation 

Sugarcane leaf tissues were collected from six month old crop at 
45 days after water inundation and from the plants grown in nor- 
mal condition of same age. Sampling at 45 days was decided arbi- 
trarily to provide ample time for the resistant genotype to express 
the genes for water logging tolerance, and the adverse effect of the 
stress is pronounced and responses of the tolerant plant more vis- 
ible. The sampled tissues were rinsed thoroughly with ice cold 
DEPC treated autoclaved water to prevent the RNA from degrada- 
tion. After washing, the samples were quickly frozen in liquid 
nitrogen and transferred to -80 °C deep freezer for further use. 
Total RNA was isolated by the modified protocol using guanidine 
thiocyanate-phenol [25]. Approximately five-gram sugarcane tis- 
sues were grounded to fine powder in liquid nitrogen and RNA 
was extracted by using GTC salt buffer. RNA was selectively precip- 
itated from the pool of nucleic acid using lithium chloride salt. Poly 
(A)+ RNA was isolated from 200 to 500 ug of total RNA using Oligo 
(dT)-cellulose through affinity chromatography. The 1% denaturing 
agarose gel electrophoresis was carried out to check the quality, 
integrity and size distribution of the RNA, using a submarine hor- 
izontal agarose slab gel. mRNA quantification was also performed 
by UV absorbance at A260 and A280. 

2.3. Subtractive library construction 

The Clontech PCR-Select cDNA Subtraction kit was used for 
preparation of subtractive cDNA library to identify differentially 
expressed transcripts in leaf of water-logged sugarcane. Double- 
stranded cDNA synthesis was carried out on poly (A)+ mRNA 
derived from the leaf tissues of waterlogged stressed (tester) and 
normal (driver) plants. The tester and driver cDNAs were then 
digested with Rsal, which yields blunt end fragments of approxi- 
mately 400 bp length on an average. Both, tester and driver, cDNA 
populations were processed following the manufacturer's instruc- 
tions, with some modifications. The tester cDNA was aliquoted into 
two halves, and each halve was ligated with different cDNA adapt- 
ors. Adapter ligation was followed by two rounds of hybridization 
with an excess of driver cDNA as per manufacturer's protocol. The 
resultant products were subjected to two cycles of PCR with adap- 
tor targeted primers to amplify the desired differentially expressed 
sequences. Amplifications were performed on a MyCycler (BioRad, 
USA). First PCR master mix contained lOx PCR reaction buffer, 
0.2 mM dNTPs, 0.4 uM PCR primer 1 and Advantage cDNA poly- 
merase (Clontech, USA). PCR was performed under the following 
conditions: 94 °C (25 sec) denaturing step followed by 30 cycles 
each consisting of a denaturation step at 94 °C (10 sec), an anneal- 
ing step at 66 °C (30 sec) and an elongation step at 72 °C (1.5 min). 
Before using the primary PCR products as templates for secondary 
PCR, these were diluted 10 fold with sterile water. The second PCR 
master mix contained lOx PCR reaction buffer, 0.2 mM dNTPs, 
0.4 p.M nested PCR primer 1, 0.4 uM nested PCR primer 2 and 
Advantage cDNA polymerase. PCR was run through 20 cycles each 
consisting of a denaturing step at 94 °C (15 sec), an annealing step 
at 66° (30 sec) and an elongation step at 72 °C (1.5 min). cDNA 
molecules were size-selected and fractions larger than 250 bp 
were cloned non-directionally into the pGEM-T-Easy Vector 
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(Promega, USA). The ligated products, i.e., pGEM-T Easy vector con- 
taining insert fragments, were transformed into Escherichia coli 
cells by using electroporation apparatus (Pulse Controller, BioRad, 
USA). Commercially available electro-competent DH10B cells of 
E. coli (GibcoBRL Life Technologies, USA) were used as the hosts. 

2.4. Template preparation 

cDNA libraries were plated onto solid Luria Bertani (LB) med- 
ium with ampicillin, X-GAL and 1PTG to select recombinant clones. 
White colonies were randomly picked to 96 well plates containing 
Freezing Medium (36 mM K 2 HP0 4 , 13.2 mM KH 2 P0 4 , 1.7 mM 
sodium citrate, 0.4 mM MgS0 4 , 6.8 mM (NH 4 ) 2 S0 4 , in LB medium 
and 4.4% glycerol), grown overnight and later stored at -80 °C. 
Recombinant plasmids were isolated using QIAprep Turbo 
Miniprep Kit (QIAGEN, Germany) by employing Genesis 
Workstation 2000 (TECAN, Switzerland) robotic system as recom- 
mended by the supplier. 

2.5. Sequencing: sequence analysis and identification of novel miRNAs 
and their target prediction 

Recombinant plasmids were single-pass sequenced from the 5' 
end. Sequencing reactions were performed with 200 ng of plasmid 
DNA using the AB1 Prism BigDye Terminator Sequencing Kit 
(Applied Biosystems, USA) and analyzed on ABI3700 automated 
DNA analyzer (Applied Biosystems, USA). Primary processing of 
the EST sequences was carried out with the basecaller PHRED 
[26], for base calling and quality trimming. Screening of vector/ 
adapter sequences was carried out with the help of Cross_match 
(www.phrap.org) software. EST sequences having PHRED score 
>20 and longer than 299 bp were submitted to the GenBank. 
However, for the purpose of clustering, size exclusion was not 
done. Resulting high-quality cDNA sequences were assembled into 
contigs using Sequencher v4.6 program (Gene Codes, Ann Arbor, 
Mich, USA) with default parameters. Sequence similarity searches 
were performed against NCBI non-redundant database 
(www.ncbi.nlm.nih.gov/BLAST/) using BLAST internet tool keeping 
all the default parameters. The top scoring hits were annotated for 
Gene Ontology (GO) annotation according to the putative function 
returned by BLASTX. 

EST sequences were subjected to stem-loop structure prediction 
using the Mfold web server (http://mfold.rna.albany.edu/ 
?q=mfold). Predictions were made using RNA sequences contain- 
ing 50-200 nucleotides on either side of the candidate miRNA. In 
case no apparent local foldback structure was predicted for a given 
sequence, larger upstream and downstream sequences were sub- 
mitted for Mfold analysis. The criteria used to identify candidate- 
structured precursors were those suggested by Allen et al, [27]. 
Target genes of the miRNAs were predicted using the online tool 
psRNATarget, a small RNA target analysis server for plants 
(http://plantgrn.noble.org/psRNATarget/). This tool uses an itera- 
tive parallel Smith-Waterman algorithm and a weighted scoring 
scheme in which mismatched bases are penalized according to 
their type and location in the alignment. Mismatches to the 5' 
and central regions of the miRNA were preferentially penalized 
compared with mismatches to the 3' region of the miRNA. 

3. Results and discussion 

3.1. Waterlogging induced morphological and structural modifications 
in sugarcane 

Sugarcane genotype BO 91 was used to prepare a subtractive 
library to study the genes expressed in sugarcane under water- 



logged condition. The selected sugarcane genotype, BO 91, is well 
adapted to areas prone to waterlogging and possesses good toler- 
ance and growth in water-logged condition [3,28]. Development 
of aerial roots and profuse rooting was observed under stressed 
condition but not in normal condition of cultivation (Fig. SI). 
Various morphological attributes were recorded in sugarcane vari- 
ety BO 91 growing in water-logged and normal condition. Mean 
value of all the growth and quality parameters are furnished in 
Table SI. Many unfavourable deviations were observed in most 
of the morphological traits when the cane was grown under water- 
logging condition. Considerable increase was observed in number 
of nodes (60.0%); internode length (17.5%) and nodes with aerial 
root (419.5%) - the characters supposedly conferring tolerance to 
waterlogging stress. Under longer duration of inundation, some 
morphological, anatomical, physiological and biochemical changes 
take place in the plant for sake of adaptation and/or survival [29]. 
Waterlogged sugarcane plants, like those of many species, undergo 
structural changes leading to cell lysis and the formation of aerial 
roots. 

Development of adventitious roots in response to waterlogging 
is considered to be a tolerance mechanism to increase root aeration 
that allows the plant to maintain root functions during flooding 
and to avoid 0 2 deficiency [30]. These above ground roots tend 
to grow horizontally to remain near the water-air interface. 
Sugarcane root system developed a dense mat of aero-tropic, small 
diameter roots when flooded. Significant increase in aerial root for- 
mation was also observed in sugarcane subjected to longer dura- 
tion of waterlogging [4]. 

Increase in internode length and decrease in cane diameter was 
observed when the cane was subjected to longer duration of 
inundation. In many cases, it is this characteristic of the shoot that 
contributes most strongly in securing oxygen when the usual sup- 
ply-route is blocked by flooding or submergence. Achieving unusu- 
ally fast rate of extension to make contact with a source of oxygen, 
and also light and carbon dioxide is a major feature contributing to 
survival in standing water stress conditions [4,5]. 

3.2. Construction of waterlogging-specific subtractive cDNA library 

Different gene expression analyses have provided valuable 
insights into metabolic perturbations and readjustments in plants 
associated with different abiotic stresses such as salt, low/high 
temperatures and drought [31,32]. ESTs generated from cDNA 
libraries should ideally represent all the genes expressed in a target 
organ/tissue, at a specific developmental stage and/or in a specific 
environment. However, variation in expression level of different 
genes in a particular tissue results in mRNAs that differ in abun- 
dance, which makes it difficult to capture the mRNAs of low abun- 
dance, in cDNA libraries. Redundant sequencing of clones 
representing the same expressed genes is another problem that 
lays burden on the efficiency and cost effectiveness of EST sequenc- 
ing [33,34] which hinders research laboratories with small budgets 
to perform EST characterization studies. To overcome this problem, 
different strategies based on normalized cDNA libraries have been 
reported in many organisms including plants [33,35]. Using the 
suppression subtraction hybridization method, Caturla et al. [36] 
identified several genes associated with submergence in adventi- 
tious root primordia of Sesbania. As the co-ordinated regulation 
of multiple genes appears to be an important factor in long-term 
flooding-stress response of plant systems [37] a suppression sub- 
traction hybridization approach was followed in this study for 
understanding how genes express differentially in waterlogging 
tolerant sugarcane genotype BO 91. Sugarcane varieties are highly 
heterozygous complex polyploid hybrid species and any two vari- 
eties differ in the number of chromosomes, thus both would be dif- 
fering at the genomic/functional level for hundreds of genes 
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making it impossible to pin-point the genes of interest. A subtrac- 
tive library from the same variety in the presence or absence of a 
stress targets the genes that are differentially expressed in 
response to that stress. The subtractive cDNA library was prepared 
by subtracting the common genes expressing in normal condition 
from the genes expressing specifically in waterlogged condition. 
A total of 96 clones were sequenced to develop water-logged 
specific sugarcane ESTs, out of which a total of 81 sequences were 
generated, 77 (80%) sequences passed quality check and were 
>100bp. The average cDNA insert size was 500 bp and average 
good quality sequence size was 306 bp. A brief summary of water- 
logging induced subtraction library is given in (Table 1 ). The size 
distribution of ESTs generated from waterlogged specific ESTs 
revealed all the generated 81 sequences were >100 bp and out of 
them 42 sequences were found >300bp (Fig. 1A) and they were 
submitted to NCB1 under the accession numbers (GenBank: 
GT757740 to GT757780). In order to remove redundancy, all the 
sequenced ESTs were assembled using Sequencher program. The 
assembled data was manually verified to remove any mis- 
assembled sequences. The final assembly contained a total of 53 
unigene ESTs with a set of 42 reads assembled as 14 contigs, while 
the remaining 39 sequences were singletons (Fig. IB). 

3.3. Sequence analysis of cDNA fragments derived from subtractive 
hybridization 

Unigene gene sequences were searched using the BLASTX tool 
from the NCB1 non-redundant database to find protein similarity. 
Biological roles were inferred for the genes identified as differen- 
tially expressed in response to waterlogging after submitting the 
sequences to protein database searches (Fig. 2). Fifty-three clones 
identified from the subtraction library in this endeavor could be 
classified into 16 functional categories based on the predicted 
functions (Table 2). The stress related genes were the largest func- 
tional class, representing 1 5% of the genes identified. The second 
largest category was that of sequences encoding the unknown pro- 
teins. These represented 13% of the expressed genes. In this study, 
it was observed that approximately 21% of the identified genes in 
the tolerant cultivar are either unclassified, or encode unknown 
proteins which were much lower than unknown genes (68%) 
identified in rice genotypes tolerant to flooding [38], A significant 
proportion of genes related to catalytic activity (13%) and cell 
growth (10%) were also induced during waterlogging stress. 

Plants have evolved adaptation mechanisms to sense oxygen 
deficiency in their environments and make coordinated physiolog- 
ical and structural adjustments to enhance their hypoxic tolerance 
[5]. It has been demonstrated that the synthesis of several proteins 



Table 1 

Characteristics of the water-logged subtractive ESTs. 



ESTs Count 



cDNA clones sequenced (5' end) 96 

Total sequences generated 81 

Sequences of good quality (>100bp) 77 

Success index (%) 80 

Average insert size (bp) 500 

Average size of good sequences (bp) 306 

Number of singletons 39 

Number of contigs 14 

Number of reads assembled in contigs 42 

Number of unique putative transcripts (unigenes) 53 

Longest unigenes (bp) 719 

Shortest but >100 bp unigene (bp) 117 

Number of assembled ESTs with significant BLASTX match 49 

No significant BLASTX match 4 

Observed redundancy (%) 45 



A 

o >300 | 1 42 

| >200 | 1 62 




Sequence length [bp] 

B 



I 7 fll 




I 1 1 1 1 

0 5 10 

Number of contigs 



Fig. 1. (A) Size distribution of ESTs generated from water-logged leaf cDNAs (B) 
Reads assembled in contigs. 

engaged in the glycolysis and fermentation processes is induced in 
plants exposed to anaerobic conditions. These anaerobic polypep- 
tides include aldolase [39,40], pyruvate decarboxylase [41 ] and 
alcohol dehydrogenase [42-44]. Genes encoding alcohol dehydro- 
genase are reported in the current study (Table 2). Alcohol dehy- 
drogenase 1 transcripts and alcohol dehydrogenase enzyme 
activity were analyzed intensively in maize seedlings and rice 
[45,46]. It has been proposed that a high level of mitochondrial 
ALDH2 protein under anaerobiosis conferred tolerance to submer- 
gence in rice [47]. 

It was interesting to observe the presence of a gene tag SUL21- 
001-Cll-N-082.g for remorins (REMs) protein in the library. This 
protein family has been reported in several plants and animals 
with no distinct function. Specific biological roles of different rem- 
orins from the various groups remain to be investigated, but gene 
expression data suggest that some of these proteins might have 
key functions during responses to biotic and abiotic stimuli and 
might possibly be involved in hormone mediated responses and 
signal transduction. Interestingly, remorins have also been found 
in detergent-resistant membrane fractions, called lipid rafts [48] 
and these are thought to act as docking sites for specific proteins 
involved in many important cellular processes, including secretion, 
signal transduction, and the perception of pathogens [49]. The EST 
cluster SUL2 1 -001 -G01 -N-004.g encoding for actin depolymerizing 
factor (ADF) was obtained in this study. The actin cytoskeleton 
forms a dynamic, stimulus-responsive network in eukaryotic cells 
and is involved in essential functions, including cell division and 
cytoplasmic streaming. Increasing evidence has suggested that 
proteins of the ADF family could be involved in the actin cytoskel- 
eton rearrangement and signal transduction events occurring 
when plants are subjected to stress conditions [50]. We found that 
cluster SUL21-001-C05-N-034.g encoded a protein similar to a 
putative serine/threonine protein kinase from Oryza sativa 
(Table 2). Members of the protein kinase superfamily catalyzes 
the reversible transfer of y-phosphate from ATP to serine, threo- 
nine, or tyrosine on target proteins. Protein kinases have been 
implicated in a large number of plant processes, including the 
development of plant reproductive organs in Petunia hybrida and 
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Fig. 2. Functional characterization of water-logged induced sugarcane leaf unigenes according to the best matched BLAST search results. 

Table 2 

Putative functions associated with ESTs induced during waterlogging in different functional categories. 



Functional Class 


Unigene identifier 


Putative function* 


Amino acid metabolism 


SUL21-001-E04-N-023.g 


Methionine synthase protein 


Catalytic activity 


Contig_7 
Contig_12 

SUL2 1-001 -B04-N-029.g 
SUL21-001-C02-N-006.g 
SUL21-001-G06-N-040.g 
SUL21-001-H06-N-048.g 


Probable cytochrome P450 monooxygenase 

RmlD substrate binding domain containing protein 

UDP-glucose 6-dehydrogenase 

Putative 2-oxoglutarate dehydrogenase E2 subunit 

Diaminopimelate epimerase 

Alcohol dehydrogenase 


Cell differentiation/cellulose degradation 


Contig_l 5 


Exoglucanase precursor 


Cell growth 


Contig 6 

SUL2 1-001 -CI 2-N-082.g 
SUL21-001-D03-N-026.g 
SUL21-001-D04-N-030.g 
SUL21-001-G01-N-004.g 


Alpfia-tubulin 
Actin 

Putative early nodulin 
Actin depolymerizing factor 


DNA metabolism 


SUL21-001-B08-N-045.g 
SUL21-001-H12-N-096.g 


Histone H2A.Z 

DNA methyltransferase 


Plant growth 


SUL21-001-G02-N-008.g 


Auxin-regulated protein-like 


Lipid metabolism 


SUL21-001-B06-N-045.g 
SUL21-001-D07-N-058.g 

SUL21-001-E10-N-071.g 


Putative CLB1 protein (calcium-dependent lipid binding) protein 
Phospholipase D alpha 1 (PLD alpha 1) (choline phosphatase 1) 
(phosphatidylcholine-hydrolyzing phospholipase D 1) 
GDSL-like lipase/acylhydrolase family protein 


Metabolism 


SUL21-001-F01-N-011.g 
SUL21-001-Fll-N-091.g 


Putative phosphoglycerate kinase 
Beta-galactosidase precursor 


Metal ion binding 


SUL21-001-A12-N-085.g 


Metal ion binding 


Protein degradation 


Contig_18 

SUL21-001-C09-N-066.g 


Kelch repeat-containing F-box-like 

26S proteasome regulatory particle triple-A ATPase subunit6 


Signal transduction 


Contig_4 

SUL21-001-C05-N-034.g 


Putative protein kinase homolog 

Putative receptor serine/threonine kinase PR5K 


Stress response 


Contig_2 

SUL21-001-G08-N-056.g 
Contig_l 1 

SUL21-001-D05-N-030.g 
SUL21-001-G09-N-068.g 
SUL21-001-G12-N-088.g 
SUL21-001-H10-N-080.g 


Putative senescence-associated protein 

Dehydration-responsive family protein 

Adhesive/proline-rich protein homolog-like protein 

Legumain precursor 

Heat shock cognate 70 kDa protein 

DnaJ protein, putative 


Sucrose metabolism 


Contig_5 

SUL21-001-H07-N-060.g 


Sucrose synthase 2 (sucrose-UDP glucosyltransferase 2) 


Transport 


Contig_16 

SUL21-001-A02-N-005.g 
SUL21-001-F04-N-031.g 


Calcium-transporting ATPase 2 
Putative cation diffusion facilitator 
Clathrin assembly protein, putative 


Unable to classify 


SUL21-001-C07-N-050.g 
SUL2 1-001 -CI l-N-082.g 


Hypothetical protein~predicted by FGENESH etc. 
Putative remorin 1 protein 



(continued on next page) 
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Table 2 (continued) 



Functional Class 


Unigene identifier 


Putative function* 




SUL21-001-E06-N-039.g 


pnFL-2 




SUL21-001-E12-N-087.g 


rRNA intron-encoded homing endonuclease 




SUL2 1 -001 -F06-N-047.g 


Expressed protein 



Unigene (non redundant) EST sequences identity number. 

* The putative annotation was based on GenBank nr (BLASTX). 



Arabidopsis thaliana [51] and it is reasonable to assume that stress 
induced protein kinases also exist in sugarcane. 

Chaperones are proteins capable of assisting the folding of other 
proteins in vivo [52]. Conditions of stress, such as temperature 

sof-MIR1 

Initial AG = -29.20 kcal/mol 



variation can induce protein aggregation inside cells and conse- 
quently, many chaperones were first identified as heat shock pro- 
teins (Hsps). Chaperones are able to bind to exposed hydrophobic 
residues in unfolded proteins (usually buried in the native state) 



CC CU C - U U U I A-- G 

5' UUCCU UGAA UUGA GA GC CCU CUG CUGGAGA CA A 
3- AGGGA ACUU GACU CU CG GGA GAC GACCUCU GU C 

UA CC C A U C CUGU A AGU AUA A 



sof-MIR2 

Initial AG = -39.00 kcal/mol 

GGAG - | CAAGCA C AU 

5' GCAGCAGCAG AGGGCAGC UCC UCC \ 

3/ UGUCGUCGUC UCUCGUCG AGG AGG G 

GAGG G A CC A AA 

sof-MIR3 

Initial AG = -34.20 kcal/mol [25 nt side loop] 

UCA A G C .-AUCUI A A 

5' UGCCG CAGUG UGA GAGC UGCA GA C 

3- ACGGC GUUAC ACU CUUG ACGU CU C 

CA- A G - \ A C G 



sof-MIR4 

Initial AG = -33.10 kcal/mol [15 nt side loop] 

GCU - GC AGAGAAGGGUG . -GUUA A | G 

5' GGA AUGUUU UCUCCGUG CAC CUC GGA G 

3' CCU UACAAG AGAGGCAC GUG GAG CUU A 

GUC G A- AGAAAGAAGUA \ - A C 

sof-MIR5 

Initial AG = -31.70 kcal/mol 

CA - - U A | CUGA CCCCU AAG 

5' UGC AUGUUGGUUG UG UGC AAGAUCA ACC GGCU GGACA \ 

3' ACG UGUAACUAAC AC GCG UUCUGGU UGG UCGA CUUGU U 

AA U U - G CAUCUUUA" UG~ UU GUU 

sof-MIR6 

Initial AG = -24.10 kcal/mol 

UUC C C- I A A UGGC 

5 , AUC UAGGC CAU GGUGUUGAAA UCC \ 
3, UAG GUCCG GUA CCGUAACUUU AGG A 

UGA U UC A - - CUUA 



sof-MIR7 

Initial AG = -36.70 kcal/mol 

AAA | UUA C U CG — A A 

5' GGUU GCCCU CGUU GCUGGU CCAAAG CGAG \ 
3' CCAG CGGGA GCGA UGACCG GGUUUC GCUU G 

GGG A U - CGAA C C 

Fig. 3. Prediction of secondary structures for novel miRNA candidates under waterlogging condition of sugarcane. The putative miRNA sequences are highlighted in red and 
miRNA* sequences are highlighted in blue. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.) 
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and according to [53] such a mechanism prevents the incorrect 
folding of the protein. The expression of chaperone proteins, 
including heat shock and DnaJ (Table 2) may indicate their role 
when sugarcane plant was subjected to water logging stress. Most 
of the chaperone and stress-related protein genes including Hsp70, 
DnaJ, and heat shock transcription factors were also reported in 
sugarcane EST (SUCEST) database [54]. 

Leaf senescence is a regulated process that corresponds to the 
last stage in leaf development. It is characterized by dramatic 
changes in cellular metabolism and the sequential degeneration 
of cellular structures. The occurrence of leaf senescence associated 
proteins confers its role under waterlogging stress in sugarcane in 
the current study and also evidenced in other studies. In addition 
to ageing, multiple developmental and environmental signals are 
able to trigger senescence. Drought, darkness, leaf detachment, 
and the hormones abscisic acid (ABA) and ethylene are reported 
to induce leaf yellowing [55]. Thirteen snow pea pod senescence- 
associated (SSA) cDNAs were isolated from a 5-day stored pod 
cDNA library using differential screening [56]. Sequence compari- 
son indicated that these cDNA clones encoded proteins related to 
cell membrane and nutrient remobilization, to disease response- 
related and ribosomal proteins, and to ubiquitin-conjugating 
enzyme. 

3.4. Identification of microRNAs induced under waterlogging 

The identification of miRNAs and their targets has proven to be 
an important tool in understanding miRNA-dependent gene regu- 
lation in many plant species including sugarcane. A large number 
of microRNAs and their putative targets have been identified in 
sugarcane subjected to drought, salt stress, and pathogen infection 
[21,22,57]. 26 conserved miRNA families and two putative novel 
miRNAs, as well as a number of trans-acting small interfering RNAs 
had been identified in sugarcane axillary bud growth, using bioin- 
formatics approach [58]. Waterlogging is one of the major causes 
of reduced crop yield among the globe [59]. The most radical effect 
caused due to waterlogging is reduced access of oxygen to the rhi- 
zosphere [60]. In Arabidopsis, a widespread change has been 
observed in 5-10% of the transcriptome when subjected to low 
oxygen condition [61,62]. Co-incidentally dramatic changes have 
also been observed in protein synthesis and degradation with cor- 
responding mRNA changes [63]. It has been proved now that 
microRNAs play important role in posttranscriptional modulation 
of gene expression involving in plant responses to abiotic stresses. 
However, the regulation of miRNA in the various morphological 
responses to waterlogging is not very well established in plants 
especially in sugarcane. In maize seedlings, a total of 61 mature 
miRNAs had been identified including 32 waterlogging responsive 
miRNAs, most of these were negatively regulated under waterlog- 
ging condition [64]. In order to understand the molecular mecha- 
nisms regulating gene expression in response to water logging 
we identified putative microRNAs and their targets using computa- 
tional approach. 

The most challenging problem in understanding plant miRNAs 
is identifying novel miRNAs, for which different approaches have 
been used in plants including forward genetics, computational pre- 
diction, and direct cloning and sequencing [19]. To uncover addi- 
tional sugarcane-specific miRNA candidates within our 
sequenced set, in all, 7 small RNAs met our criteria as established 
according to Allen et al. [27] and were considered to be putative 
novel sugarcane miRNAs. We did find both the miRNA and the 
miRNA* sequences (Fig. 3). The detection of miRNAs* is a strong 
clue, albeit not infallible, about the existence of precursor hairpin 
structures, adding credence to the authenticity of the predicted 
candidates. Precursors of these novel miRNAs had negative fold- 



ing-free energies ranging from -39 to -24 kcal mol -1 , with an 
average of about -33 kcal mol -1 according to the Mfold RNA fold- 
ing platform. These values are similar to the free energy values of 
other plant miRNA precursors and are much lower than the 
reported folding-free energies of tRNAs or rRNAs [65]. Finally, 
BLASTN analysis against all nucleotide sequences in the NCBI dat- 
abases revealed no homologs for these miRNAs in other plant spe- 
cies, signifying that these newly identified putative miRNAs are 
specific to sugarcane species. To clarify the biological functions of 
the newly identified sugarcane miRNAs, we searched for putative 
target genes using the psRNATarget program with default parame- 
ters [66]. The target genes found for novel sugarcane miRNAs are 
listed in (Table S2) and most of them code for hypothetical DnaJ 
like proteins. The miRNA-regulation of gene expression plays a 
key role in the development and response to biotic and abiotic 
stresses [67], To date, only 35 sugarcane miRNAs are deposited 
at the miRBase database current version (release 20). The identifi- 
cation of miRNAs and their targets is important not only to help us 
learn more about the roles of miRNAs in sugarcane development 
and physiology, but also to provide a basis for further studies on 
RNAi-based regulation mechanisms under waterlogging stress. 

4. Conclusions 

In summary, the comparative global transcript profiling in sug- 
arcane plants undertaken in this study suggests that transcripts 
associated with stress response, signal transduction, metabolic 
activity and ion transport play an important role in conferring 
waterlogging tolerance in the sugarcane genotype, BO 91. The sen- 
sitivity of the subtraction hybridization technique allowed the 
identification of factors involved in different biological functions. 
The result indicates that the lengths of the sequences reported in 
this study are good enough to retrieve hits in GenBank database. 
Sequences of only two unigenes showed homology to the proteins 
identified in sugarcane which indicates that information related to 
genes expressed during waterlogging stress is inadequate at global 
level. However, in the present study, 53 unique sequences were 
generated from 96 clones which may only give a glimpse of the 
transcriptome expressing in waterlogging tolerant sugarcane 
genotype under study. Further work is needed to consolidate the 
preliminary results and find out the battery of genes involved in 
transient to stable response to stress by sampling at different time 
intervals and various organs that perceive and respond to the 
stress. The cDNA clones obtained from this work need to be further 
characterized using simpler model systems like yeast and E. coli or 
higher systems, particularly gene knockouts or through ectopic 
over-expression of genes using a transgenics approach. miRNA 
analysis was undertaken with the objective to chance upon any 
relevant contributors to stress tolerance. Our findings provide the 
first miRNA discovery in sugarcane under waterlogging stress 
and are suggestive of the potential role of miRNA in regulatory 
pathways of sugarcane and other crops. Future work on these lines 
would definitely enhance our understanding of the genetic basis of 
the waterlogging stress tolerance mechanisms and provide newer 
directions in developing sugarcane varieties tolerant to waterlog- 
ging stress. 
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